Constructive Induction On Decision Trees
نویسندگان
چکیده
Selective induction techniques perform poorly when the features are inappropriate for the target concept. One solution is to have the learning system construct new features automatically ; unfortunately feature construction is a difficult and poorly understood problem. In this paper we present a definition of feature construction in concept learning, and offer a framework for its study based on four aspects: detection, selection, generalization, and evaluation. This framework is used in the analysis of existing learning systems and as the basis for the design of a new system, CITRE. CITRE performs feature construction using decision trees and simple domain knowledge as constructive biases. Initial results on a set of spatial-dependent problems suggest the importance of domain knowledge and feature generalization , i.e., constructive induction. 1 Introduction Good representations are often crucial for solving difficult problems in AI. Finding suitable problem representations , however, can be difficult and time consuming. This is especially true in machine learning: learning can be relatively easy if the training examples are presented in a suitable form, but when the features used in describing examples are inappropriate for the target concept, learning can be difficult or impossible using selective induction methods. To overcome this problem a learning system needs to be capable of generating appropriate features for new situations. This paper is concerned with the automated construction of new features to facilitate concept learning, an issue closely related to the "problem of new terms" [Di-etterich et a/., 1982] and constructive induction [Michal-ski, 1983]. We begin by defining "feature construction" in the context of concept learning from examples, and then proceed to identify four inherent aspects: detection, selection, generalization, and evaluation. These aspects comprise an analytical framework for studying feature construction which we describe through examples drawn from the existing systems of BACON.1, BOGART, DUCE, PLSO, and STAGGER. This framework serves as the basis for the design of CITRE, a system that performs feature construction on decision trees using simple domain knowledge. The results of our initial experiments demonstrate CITRE'S ability to improve learning through feature construction in a tic-tac-toe classification problem. Extensions of CITRE for this and other problems are also discussed. 2 The Problem We state the following definition of feature construction 1 in concept learning: Feature Construction: the application of a set of constructive operators {o 1 , 0 2 , ...o n } to a set of existing features {f 1 ,f 2 …
منابع مشابه
Constructive Induction of Cartesian Product Attributes
Constructive induction is the process of changing the representation of examples by creating new attributes from existing attributes. In classi cation, the goal of constructive induction is to nd a representation that facilitates learning a concept description by a particular learning system. Typically, the new attributes are Boolean or arithmetic combinations of existing attributes and the lea...
متن کاملImplementation of Constructive Induction – A Data Mining Technique in Agro Database for Optimal Solution
This research paper is about constructive induction of data mining (DM) which is used to cope with inadequacy of attributes. Constructive induction is a process of learning concept description that represents best hypothesis in the space. Inductive learning algorithms are increasingly being pressed into service, as data mining and knowledge discovery tools, to detect patterns or regularities in...
متن کاملConstructive Induction On Decision
Selective induction techniques perform poorly when the features are inappropriate for the target concept. One solution is to have the learning system construct new features automatically ; unfortunately feature construction is a diicult and poorly understood problem. In this paper we present a deenition of feature construction in concept learning, and ooer a framework for its study based on fou...
متن کاملCN2-MCI: A Two-Step Method for Constructive Induction
Methods for constructive induction perform an automatic transformation of description spaces if representational shortcomings deteriorate the quality of learning. In the context of concept learning and propositional representation languages, feature construction algorithms have been developed in order to improve the accuracy and to decrease the complexity of hypotheses. Particularly, so-called ...
متن کاملConstructive Induction on Continuous
In this paper we discuss the problem of selecting appropriate operators for constructive induction. We argue that in problems that are described, at least partially, by continuous features, discriminant analysis is a useful tool for constructive induction. This new method for constructive induction was implemented in system Ltree. Ltree is a data driven constructive induction system, able to de...
متن کاملID2-of-3: Constructive Induction of M -of-N Concepts for Discriminators in Decision Trees
We discuss an approach to constructing composite features during the induction of decision trees. The composite features correspond to m-of-n concepts. There are three goals of this research. First, we explore a family of greedy methods for building m-ofn concepts (one of which, GS, is described in this paper). Second, we show how these concepts can be formed as internal nodes of decision trees...
متن کامل